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Abstract — An achievable rate region, based on lattice interfer- 
ence alignment, is derived for a class of time-invariant Gaussian 
interference channels with more than two users. The result is 
established via a new coding theorem for the two-user Gaussian 
multiple-access channel where both users use a single linear 
code. The class of interference channels treated is such that 
all interference channel gains are rational. For this class of 
interference channels, beyond recovering the known results on 
the degrees of freedom, an explicit rate region is derived for finite 
signal-to-noise ratios, shedding light on the nature of previously 
established asymptotic results. 

Index Terms — Multiple access channel, linear codes, interfer- 
ence channel, interference alignment. 

I. Introduction 

An important open problem in network information theory 
is determining the capacity region of the interference channel. 
The interference channel is a communication model where 
multiple pairs of transmitters and receivers utilize the same 
communication medium. As a result, each user receives the 
output of a multiple-access channel, i.e., it suffers from 
interference from transmissions intended for other users. 

An important special case of this channel model is the 
Gaussian interference channel, where each receiver sees a 
linear combination of its intended signal and the signals 
transmitted by the interfering users plus an additive white 
Gaussian noise (AWGN). For the case where only two users 
are sharing the same medium, i.e., the interference at each 
receiver is generated by only one user, the capacity region was 
characterized up to half a bit only recently |fl]. The achiev- 
ability part utilizes the Han-Kobayashi O scheme which is 
shown to be nearly optimal in the two-user case. The results 
of HI are rather disappointing in the sense that they imply 
that for a wide range of channel parameters, either treating 
the interference as noise, or alternating access to the medium 
(i.e., time sharing) between the two transmitter-receiver pairs, 
is a reasonable approach. In particular, time sharing yields the 
maximal degrees of freedom (DoF) afforded by the channel 
(i.e., one), where the number of DoF is defined as the ratio 
between the maximal possible sum rate and 1/2 log(SNR) in 
the limit where the SNR goes to infinity. 

An interesting aspect of the interference channel is that the 
two-user case does not capture the quintessential features of 
the general ( if-user) interference channel, as has recently been 
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demonstrated in the framework of linear Gaussian interference 
channels. In particular, while one may have suspected that the 
channel would be interference limited, i.e., that time sharing 
would be optimal at high SNR, it has been demonstrated that 
this is not the case. Rather, it has been shown [3|-[6| that 
the correct "extension" of the two-user case is that in general, 
K/2 DoF are afforded by the if-user Gaussian interference 
channel. 



A. Related Work 

The works of [3|-[6| have revealed that the Han-Kobayashi 
approach is inadequate for K > 2, and a new approach, 
namely, interference alignment, was needed to achieve the DoF 
afforded by the (general) if-user interference channel. 

The concept of interference alignment was introduced by 
Maddah-Ali et al. in Q in the context of the multiple-input 
multiple-output (MIMO) X-channel, and then by Cadambe and 
Jafar in for the if -user Gaussian interference channel with 
time-varying gains. The idea behind interference alignment in 
its original form, i.e., for time-varying channels, is essentially 
to split the signal space observed by each receiver into two 
orthogonal subspaces, where all the interference signals are 
confined to one, while the intended signal occupies the other. 
That is, if at each receiver (simultaneously for all receivers) the 
interferences can be forced to lie in a subspace of dimension 
roughly half of that of the signal space, the rest of the 
signal space is free of interference and can be used for 
communicating the intended messages and hence roughly K/2 
DoF can be utilized. 

For the case of time or frequency varying channels, under 
some mild ergodicity conditions, this approach achieves the 
maximum possible DoF. Specifically, the interference align- 
ment scheme of |3| achieves K/2 degrees of freedom while 
in |£S) it was shown that for a fully connected K-user interfer- 
ence channel, K/2 is also an upper bound on the number of 
DoF. It was later shown in [4] that for a large family of time or 
frequency varying channels, each user can achieve about half 
of its interference-free ergodic capacity even at finite SNR. 

The results of |3| and [4 J are encouraging in the sense 
that they imply that the if -user Gaussian interference channel 
is not inherently interference limited, a result that was quite 
unanticipated from the studies on the two-user case, which had 
been the focus of nearly all studies of Gaussian interference 
channels for nearly three decades. 

The focus of this paper is the real time-invariant (constant 
channel gains) A'-user Gaussian interference channel, for 
which another from of interference alignment has proven to 
play a key role as well. In this case, it was shown in [5|, |6| 
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that by taking the transmitted signal to belong to the integer 
lattice, it is possible to align the interference so that it remains 
confined to this lattice. As a result, the minimum distance of 
the received constellation at each receiver does not decrease 
with K, and when the SNR approaches infinity, each receiver 
can decode its intended signal with rate ss 1/4 log SNR, 
yielding a total of K/2 DoF. Thus, linear constellations, i.e., 
a PAM constellation in the one-dimensional case, play a key 
role in interference alignment for time-invariant channels. 

Specifically, it was shown in [5| that if at each receiver, 
the channel gains corresponding to the interferers are rational, 
whereas the direct channel gains corresponding to the intended 
signal are irrational, K/2 degrees of freedom are achievable. 
Even more interestingly, the authors of [5| have shown that 
if the direct channel gains are rational as well, the degrees of 
freedom of the channel are strictly smaller than K/2. Later, 
the authors of [6| proved that the DoF of the time-invariant 
interference channel are K/2 for almost all sets of channel 
gains. 

As noted above, linear/lattice codes play a key role in 
coding for time-invariant Gaussian interference channels. This 
feature of the K-usei Gaussian interference channel is shared 
with a growing number of problems in network information 
theory where lattice strategies have been shown to be an 
important ingredient. There are several examples where lattice 
strategies achieve better performance than the best known ran- 
dom coding strategies. In particular, Philosof et al. introduced 
lattice interference alignment in the context of the doubly-dirty 
multiple-access channel [9], i.e., to a Gaussian multiple-access 
channel with multiple interference signals, each known to a 
different transmitter. Other network scenarios where lattices 
play a key role are the two-way (or multiple-way) relay 
problem [101 and in the compute & forward approach to 
relay networks ifTTl . 

Lattice interference alignment for the interference channel 
was first proposed by Bresler et al. in lfP21 . ifPTl . where 
an approximate characterization of the capacity region for 
the many-to-one and one-to-many interference channels was 
derived. Lattice interference alignment was later utilized by 
Sridharan et al. in lTT4l where a coding scheme where all users 
transmit points from the same lattice was introduced. If at 
each receiver all the gains corresponding to the interferers are 
integers, the sum of the interferences is a point in the same 
lattice and thus the interference from K— 1 users is confined to 
one lattice. Under very strong interference conditions, which 
are defined in [14| and play the same role as the well-known 
very high interference condition ifTSl for the two-user case, the 
decoder can first decode the sum of interferers while treating 
the desired lattice point as noise, then subtract the decoded 
interference, and finally decode the intended codeword. Later, 
in |fl6l , this scheme was combined with a layered coding 
scheme in order to show that lattice interference alignment 
can yield substantial gains and, in particular, achieve more that 
one DoF in some cases for a broader (but still quite limited) 
class of channels. 

The works of ITMl and lfl6l allowed for important progress 
towards the understanding of interference alignment for fi- 
nite SNR. Nonetheless, these results are limited since they 



essentially rely on using superposition with a judicious choice 
of power allocation such that a very strong interference 
condition holds, in conjunction with successive decoding. In 
the decoding procedure, a single layer is decoded at every 
step while the other layers are treated as noise. At each step 
of the successive decoding procedure, the decoder sees an 
equivalent point-to-point channel where lattice codes are used. 
The performance of lattice codes over AWGN point-to-point 
channels are well understood, and therefore the scheme of [14] 
and 1 16 1 can be analyzed with relative ease. For special classes 
of channel gains, it is possible to design a layered codebook 
that is simultaneously good for all receivers. For a wide range 
of channel parameters, however, such a layered scheme is not 
beneficial, as is also noted in [16]. 

B. Summary of Results 

The main contribution of the present work is in providing 
a general framework for lattice interference alignment that 
is not confined to successive decoding. A coding theorem is 
established for a multiple-access channel where all users use 
the same linear codebook. 

Specifically, if the interference is aligned to a lattice, but the 
very strong interference condition is not satisfied, the decoder 
can still perform joint decoding of the interference codeword 
and the desired codeword. A major obstacle however arises 
when one attempts to jointly decode multiple codewords: The 
alignment of all interferers into one lattice point, which occurs 
simultaneously at all receivers, is only possible due to the 
fact that all users transmit lattice points from the same lattice. 
Thus, if joint decoding is applied, each decoder sees a two-user 
Gaussian multiple-access channel (MAC) where both users use 
the same linear code, for which, to the best of the authors 
knowledge, no previous results are known. 

The fact that the number of DoF of many families of 
interference channels is K/2 implies that it is sufficient to 
transform the channel seen by each receiver into a two-user 
MAC in order to achieve optimal performance, i.e., half of the 
resources are dedicated to the interference and the other half 
to the intended signal. In light of this observation, along with 
the proven advantages of lattice codes in creating alignment 
between different users, it is important to study the achievable 
performance of the two-user Gaussian MAC where the same 
lattice code is used by all transmitters. 

In this paper, we first address the question of finding an 
achievable symmetric rate for the Gaussian (modulo-additive) 
MAC with two users that are "forced" to use the same 
linear code. We then employ this new ingredient in order to 
analyze an interference alignment scheme, suitable for a class 
of interference channels, which we refer to as the integer- 
interference channel, where all cross gains are integers (or 
rationals). The analysis is not asymptotic in the SNR. 

While the proposed coding scheme does not require asymp- 
totic conditions, we show that it is asymptotically optimal 
in a DoF sense, i.e., it achieves K/2 degrees of freedom 
for the integer-interference channel provided that the direct 
gains are irrational. The achievable rate regions enables to 
shed light on the "mystery" around the effect of the direct 
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channel gains being rational or irrational, which has to date 
only been understood for asymptotic high SNR conditions. 
In the proposed scheme, rational direct channel gains of the 
form r/q limit the achievable symmetric rate to be smaller 
than logg, which is not a serious limitation if q is large and 
the SNR is moderate, but does indeed pose a limitation in the 
limit of very high SNR. 

Moreover, previous results [0, |]6] state that the DoF of 
an interference channel with integer interference gains, are 
everywhere discontinuous in the direct channel gains. Such a 
result is quite displeasing and calls into question the appli- 
cability of interference alignment for time-invariant channels 
at non-asymptotic conditions, i.e., raises questions w.r.t. the 
robustness of lattice interference alignment. The results of this 
work demonstrate the behavior of the rate when the direct 
channel gains approach a given set of rational numbers. The 
(derived achievable) rate is continuous, as is to be expected, 
everywhere in the direct channel gains for any SNR, but the 
variation, i.e., sensitivity to the direct channel gain, increases 
with the SNR. 

While the presented scheme is only valid for channels 
where all the (non-direct) interference gains are integers (or 
rationals), we believe that the results are an important step 
towards the understanding of the feasibility of interference 
alignment for general time-invariant interference channels in 
the finite SNR regime. 

The rest of this paper is organized as follows. In Section ITTI 
some notations used throughout the paper are defined. In 
Section [Hi] an achievable symmetric rate is derived for a two- 
user Gaussian (modulo-additive) MAC where both users use 
the same linear code. Section [IV] presents an interference 
alignment scheme for finite SNR. Section [V] discusses the 
effect of the direct channel gains being rational vs. irrational 
on the performance of interference alignment. In Section |VT] 
possible approaches for interference alignment when the in- 
terference gains are not restricted to be integers (or rational) 
are discussed. The paper concludes with Section IVIII 

II. NOTATIONAL CONVENTIONS 

Throughout the paper we use the following natational con- 
ventions. Random variables are denoted by uppercase letters 
and their realizations by lowercase letters. For example X is a 
random variable whereas a; is a specific value it can take. We 
use boldface letters to denote vectors, e.g., x denotes a vector 
with entries Xi. 

A number of distinct modulo operations are used extensively 
throughout the paper. The notation x mo d r Q m denotes reducing 
i£l modulo the interval [a, b). That is, x mo( j r a ,6) is equal 
to 

x — m • (b — a) 
where m € Z is the (unique) integer such that 

x — m ■ (b — a) € [a, b). 
Similarly, x mo d p where x € Z p is defined to equal 

x — m ■ p 

where m G Z is the unique integer such that x — m ■ p £ Z p . 



If x is a vector, the notation x moc j r a m is understood to 
mean reducing each component of x modulo the interval [a, b). 
We define the basic interval I as [-L/2, L/2) where L = 
\/l2. Reducing x modulo the interval X is denoted by x*, i.e., 

X* = [x] mo d [-L/2X/2)- 

The Euclidean norm of a vector x is denoted by ||x|j. The 
notation [^1 denotes rounding x to the nearest integer. We 
denote the set of all prime numbers by V. All logarithms in 
the paper are to the base 2 and therefore all rates are expressed 
in bits per (real) channel use. All signals considered in this 
paper are real valued. 

III. Achievable Symmetric Rate for the Two-User 
Gaussian MAC with a Single Linear Codebook 

A. Problem statement 

We consider the modulo-additive MAC 



Y = [X 1 + 1 X 2 + Z]* 



(1) 



where Z is an i.i.d. Gaussian noise with zero mean and 
variance E [Z 2 ] = 1/SNR. We are interested in characterizing 
the achievable rate region for this channel where both users 
are forced to use the same linear code, and where both users 
are subject to the power constraint 



n 



< 1. 



Note that a random variable uniformly distributed over I has 
unit power. 

An (n, R) code for this model is defined by one encoding 
function 



f :{!,..., 2 nR } 
and one decoding function 

g:l n -> {1,...,2" R } x {l,...,2 nR } . 

The linearity constraint on the encoding function / is ex- 
pressed by the condition that for any iwi, w 2 € {l, . . . , 2 nR }, 
there exists a w$ € { 1, . . . , 2 ni? } such that 

[f(w 1 )+f(w 2 )]*^f(w 3 ). (2) 

Specifically, user 1 chooses a message wi 6 {l, . . . ,2 nR \ 
and transmits xi = f(w\), and user 2 chooses a message 
u>2 G |1, ...,2 n ) and transmits X2 = f{w2). The decoder 
upon receiving 

y = [xi +7x2 + z]* , 

generates estimates for the transmitted messages 

{wi,w 2 } = g(y). 

The error probability for decoding the transmitted messages is 
denoted by 

P e = Pr ({u>i,w 2 } 7^ {wi,w 2 }) . 

We say that a symmetric rate R is achievable if for any e > 
and n large enough (depending on e), there exists an (n, R) 
linear code such that P e < e. 
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B. Connection to the standard Gaussian MAC and previous 
results 

The channel ([D can be viewed as a degraded version of the 
Gaussian multiple access channel 

Y = Xi+ 7X2 + Z, (3) 

as Y can be obtained from Y by the transformation 

Y = Y*. 

It follows that the achievable symmetric rate for our channel 
model with the constraint that both users use the same linear 
code is upper bounded by that of the channel (O where each 
user can use any codebook with rate R. 

The capacity region of the Gaussian MAC (O was charac- 
terized by Ahlswede [17| through the following equations 

Ri < - log (1 + SNR) 
R 2 < i log (1 + 7 2 SNR) 

Ri + R2<\ log (1 + (1 + 7 2 )SNR) . 

It follows that the symmetric capacity (i.e. the maximum 
achievable R = R% = R2) is given by 

C = min I ~ log (1 + SNR) , ~ log (l + 7 2 SNR) , 

ilog(l + (l+ 7 2 )SNR) J. (4) 

The achievable part of the capacity theorem is proved using 
two different random codebooks, whereas we restrict both 
users to use the same linear (over the group 1 with the addition 
operation) codebook. The main result of this section is the 
following theorem. 

C. Main result and discussion 

Theorem 1 (MAC with one linear code): For the setting 
described in Section ITH-AI (a two-user Gaussian MAC channel 
where both users use the same linear code), the following 
symmetric rate is achievable 

R < max min 

pev(i) 



(5) 




log - 



where 

ancQ 

P'( 7 ) 



5(p, 7) = min I 

ZGZ p \{0} 



(6) 



P e V 



3SNR 1 



< 1 - 2p ■ e 



(7) 



' Replacing the constraint p £ T 3 ' (7) with the constraint p £ V results in a 
negligible change in the rate region described by j3},|6) for values of 7 that 
are not very "close" (w.r.t. the SNR) to ±1/2. 



Discussion: Inspecting the equations describing the achiev- 
able rate region of Theorem Q] the role of the optimization 
parameter p, and the factor 5(p, 7) may seem at first strange. 
The factor 5(p, 7) is a measure of how accurately 7 can be 
approximated by a rational number with a denominator smaller 
than p. For example, if 7 is a rational number that can be 
written in the form 7 = r/q, then for any p > q we have 
Hp> 7) — 0- ^ this is the case, only values of p < q yield 
non-trivial rates in (0, which in turn implies that for any value 
of SNR the rate of (ID is smaller than log g@ 

Example: 

In order to informally explain the rate saturation phe- 
nomenon (why R < \ogq for any SNR), we consider an 
example where 7 = 1/3. Assume that user 1 transmits the 
codeword xi = f{w\) and user 2 transmits the codeword 
x 2 = f{w2)- We next observe that there are roughly 2 nR 
(up to a polynomial multiplicative term) "competing" pairs of 
codewords that can cause an error with a probability no smaller 
than (1/3)™. Using the union bound as an approximation for 
the average error probability of the code, this implies that the 
achievable rate can be no greater than log 3. 

From the linearity property of the code (fj), there exists^ a 
codeword X4 = f(wi) such that 

x 2 = [3x 4 ]* . 

Furthermore, for any codeword X5 = f(w$), which is inde- 
pendent of Xi and x 2 , there exists a codeword x 3 = /(W3) 
such that 

x 3 = [3x 5 ]* . 
Now consider the pair of competing codewords 



and 



Xj = [Xi + x 4 + x 5 j 



-x 3 J 



which exists by the linearity of the codebook. After passing 
through the channel fl}, the "distance" between the transmitted 
pair of codewords and the "competing" pair of codewords is 



U 



1 



1 



Xl + -X 2 - XJ - -X 2 



xi + -x 2 - Xi - x 4 - x 5 



-X3 



- [3x 4 ]* - x 4 



The terms 



[3x 4 



xs + g [-3x 5 ] 



x 4 



(8) 



"This phenomenon is unique to the case where the codebook used by the 
two transmitters is linear. In the case where both transmitters use the same 
random codebook, it can be shown that (at least) the cutoff rate is identical to 
that of two different random codebooks (except for the singular case where 
7 = 1 where it is impossible to distinguish between the users) 

3 Here we assume that the mapping [3x]* gives a different result for every 
x £ C, as is the case for the ensemble of codebooks we consider in the 
sequel. 
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and 



SNR=20dB 



1 



-3x 5 f 



can only take values in the set {— L/3, 0, L/3}, which implies 
that U can only take values in {— L/3, 0, L/3} as well. 

If we further assume that each of the codewords in the 
codebook has a memoryless uniform distribution over the 
interval] T, we can conclude that U has a memoryless uni- 
form distribution over {— L/3, 0, L/3}. This means that after 
passing through the channel, the transmitted pair {Xi, X2} is 
equivalent to the competing pair {XjjX^} with probability 
(1/3)™. Since there are about 2 nR possible choices for x 3 , we 
see that rates above B = log 3 would result in error with high 
probability. 

While this conclusion relies on the union bound, which 
may not be tight for some cases, since almost every pair of 
two codewords from a randomly drawn linear codebook are 
statistically independent, the union bound is in fact not a bad 
approximation in the considered problem. 

Comparison with random codebooks: 

In order to better understand the performance of our coding 
scheme, we compare the maximum symmetric rate it achieves, 
which we refer to as i?u n (SNR), with that achieved by a coding 
scheme that utilizes two different random codebooks. We refer 
to the latter symmetric rate as i? ran d (SNR) which is given by 
©. Define the normalized rate 



i(SNR) = 



i?lin(SNR) 



(9) 



i? rand (SNR)' 

Figure Q] depicts r norm (SNR) as a function of 7 £ [0,0.5) 
for a range of moderate to high values of SNR, specifically 
SNR = 20dB, 30dB and 40dB. Figure [2] depicts r norm (SNR) 
as a function of 7 € [0, 0.5) for extremely high values of SNR, 
namely SNR = lOOdB, HOdB and 120dB. 

Figures Q] and [2] demonstrate the sensitivity of the rate to the 
channel gains. For a range of "reasonable" values of SNR, the 
rate changes rather smoothly with 7. For extremely high SNR, 
however, a slight change in the value of 7 may dramatically 
change the achievable rate. 

The figures also suggest that for almost every value of 7, 
the normalized rate r norm (SNR) approaches one as the SNR 
tends to infinity. Thus, the symmetric rate achieved when both 
users are using the same linear code scales with the SNR as 
L'rand(SNR) for asymptotic SNR conditions. 

Note that Theorem Q] does not take into account shaping 
issues, since it uses a one-dimensional lattice as the coarse 
lattice. We have chosen not to pursue shaping in this paper in 
order to simplify the analysis. Moreover, the losses resulting 
from using the same linear code for both users outweigh 
the shaping loss, which can be upper bounded by a constant 
number of bits. Nevertheless, it is most likely that extending 
Theorem [T] to the case where both users use the same nested 
lattice codebook, with a "good" coarse lattice, can result in a 
higher symmetric rate for the two-user MAC. 

4 A uniform distribution over a uniform grid inside X would have the same 
effect. 
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Fig. 1 . r norm (SNR) plotted as a function of 7 for SNR = 20dB , 30dB , 40dB . 
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Fig. 2. r norm (SNR) plotted as a function of 7 for SNR 
lOOdB, HOdB, 120dB. 



D. Proof of Theorem Q] 

We first describe the process of the code generation, the en- 
coding and the decoding procedures, and then turn to analyze 
the error probability in decoding the transmitted messages . 

Construction of linear codebook ensemble: 

We begin by describing the generation process of the 
ensemble of linear codebooks considered, which is a variant of 
the well known Construction A (see e.g. [18]). A generating 
matrix G of dimensions k x n is used, where all elements 
of G are independently and randomly drawn according to the 
uniform distribution over the prime field Z p . We set 

k = n — — . 

logp 

The set C is generated by multiplying all fc-tuple vectors 
with elements from the field Z p by the matrix G (where all 
operations are over Z p ) 

C = {c = w T G wGZf 1 }. 
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We refer to the vectors w as message vectors, and note that 
there are 2 nR such vectors, each corresponding to one of the 
possible messages. 

Finally, the codebook C is generated from the set C by 
properly scaling and shifting it such that it meets the power 
constraint 



C 



C 



The ensemble of codebooks created by the above procedure 
satisfies the following properties: 

1) For any set of linearly independent message vec- 
tors, w 1; w 2 , . . . , w;, the corresponding codewords 
Xi, X2, . . . , X; are statistically independent. 

2) Each codeword X (except for the zero codeword) is 
memoryless: 



P(X)=l[P(X t ). 



t=i 

3) For any the corresponding codeword is uniformly 

distributed over the constellation 

L 



A 



p 



4) Each codeword X in the ensemble satisfies the power 
constraint 

— E [||X|| 2 1 < 1. 
n 

5) Each codebook in the ensemble satisfies the linearity 
constraint as defined in (|2). 

Encoding: 

Suppose a codebook from the above ensemble, which is 
completely characterized by the matrix G, has been chosen. 
User i uniformly draws a message vector Wj, and transmits 



(10) 



x t = — w, G 

The channel output is thus 

y = [xi + 7x2 + z]* . 

Decoding: 

Given the encoding matrix G and the channel output y, the 
decoder searches the pair of codewords {xi,Xj} for which 



is closest to y in the following sense 



hJ 



arg mm 

i,3 




vt - i>t(i,j)Y 



(ii) 



(12) 



If there is more than one pair of indices satisfying (Q~2), an 
error is declared. 

The decoder only searches over the pairs of codewords 
corresponding to message vectors {wj,w,-} that are linearly 
independent (over Z p ). This constraint on the decoder fa- 
cilitates the analysis of the error probability since it means 
that when G is assumed to be random, the decoder only 



searches over the pairs of codewords {X^Xj} which are 
statistically independent. The above constraint implies that if 
the users had chosen message vectors {wj,w,-} which are 
linearly dependent, an error event occurs. For the rest of the 
analysis we assume that indeed the chosen message vectors 
are linearly independent, and as a consequence {X^Xj} are 
statistically independent when G is assumed to be random. 
We account for the probability of the error event that occurs 
when this is not the case, in the final step of the proof. 

We note that the decision rule (fT2l is an approximation of 
the maximum-likelihood decoder which searches for a pair of 
codewords {x^ , Xj } that satisfies 

Pr(y|x if Xj) > Pr(y|x m ,x;) V(m,Z) ^ 

The suboptimal decoding algorithm we use simplifies the 
analysis, but does not incur significant losses w.r.t. the optimal 
maximum-likelihood decoder (for the channel ([T}). 

Analysis of error probability: 

We analyze the average error probability over the ensemble 
of codebooks described above, i.e., we assume the generating 
matrix G is random, and average over all possible realizations 
of G. 

Assume that the message vectors {wi,W2} were chosen 
by users 1 and 2 respectively, such that codeword Xi was 
transmitted by user 1, and X2 by user 2. We first analyze 
the pairwise error probability, i.e., the probability of the de- 
coder preferring a (different) specific pair of message vectors 
{wj, W2}, corresponding to the pair of codewords {Xj, X^}, 
over the transmitted pair. 

As we recall, due to the linear structure of the codebook, 
linear dependencies within the set of chosen and "compet- 
ing" message vectors {wi, W2, wj, W2} result in statistical 
dependencies within the set of transmitted and "competing" 
codewords {Xi, X2, Xj, X2}. We are interested in the aver- 
age pairwise error probability associated with each pair of 
"competing" message vectors {wj, w^}. Thus, the average 
pairwise error probability has to be analyzed w.r.t. each one of 
the possible statistical dependencies. We develop upper bounds 
on the average pairwise error probability associated with each 
type of statistical dependency, and then invoke the union bound 
in order to establish an upper bound on E[P e ], the average 
probability of the decoder not deciding on the correct pair of 
transmitted codewords. Using this bound, an achievable rate 
region is obtained. 

Denote the pairwise error probability from the pair of 
message vectors {wi,W2} to the pair {wjjW^}, for a given 
codebook in the ensemble, by P e , pa ir, and the average pairwise 
error probability over the ensemble by E [P e , pa ir]- 

We begin by deriving a general expression that upper 
bounds the average pairwise error probability E [P e . pa ii-] and 
then evaluate it for each of the possible statistical dependencies 
within the set {Xi, X2, Xj, X2}. 

The decoder makes an error to the pair {Xj, X2} only if 

n n 

J2 ( % - Mh 2)]* ) 2 >£([ r '- 2)]* ) 2 , (13) 
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where is defined in (fTTT i. The condition in ( fT3l ) is 

equivalent to 

n n 

E ( Z *) 2 > E + 2) - *t(I, 2)]*) 2 . (14) 
t=i t=i 

Define the pairwise difference random variable 

U t = [*t(l,2)-* t (l,2)]* 

= [X 1>t + 7 X 2 , t - X U - 1 X %t ] * . (15) 

and vector 

U = [UxU 2 ... U n ] . (16) 

Note that the distribution of the pairwise difference vector 
U encapsulates the statistical dependencies in the set of 
codewords {Xi, X 2) Xj, Xj}. We first express our upper 
bounds on the average pairwise error probability as a function 
of the random vector U, and only then account for the fact that 
the statistics of U vary with the different types of statistical 
dependencies between the transmitted and the "competing" 
pairs of codewords. 

Substituting (TT~5T > into (fl4l . we have that an error occurs 
only if 



E(^) 2 >E(t^ + ^ 



(17) 



Given a specific codebook from the ensemble was chosen, U 
is deterministic, and ( TT7| > implies 



Pp 



Pr nz*ir > [Z 



= Pr 



> min II Z* 



(18) 



Let T" = { — 1,0, 1}". Since every coordinate of the vectors 
u and z* has an absolute value smaller than L/2, the value of 
v G LZ n that minimizes the expression ||z* + u + v|| 2 cannot 
have an absolute value greater than L in any component, and it 
suffices to limit the search for it to LT n . Hence ( fT8l simplifies 
to 



Pp 



Pr [ ||Z*|r > min ||Z* 

veLT" 



(19) 



We now state a simple lemma that enables us to replace the 
folded Gaussian noise Z* in (T% with a simple Gaussian noise 
Z. 

Lemma 1: For z G 1", u G I n and the events 



and 



> min ||z* 

veLT™ 



> min llz + u - 

v6LT™ 



(20) 



(21) 



the following relation holds 
Proof: See Appendix U 



The next lemma provides an upper bound on E [P eiPa ir], 
the average pairwise error probability over the ensemble, that 
depends only on E [cxp {-^C/ 2 }] ■ 

Lemma 2: The average pairwise error probability over the 
ensemble is upper bounded by 



E[P e , pair ] <n n , 



where 

Q = E 



exp 



SNR, 



-U' 



2 exp 



3SNR1 



(22) 



Proof: Using Lemma [T] we have 



P e , P air = Pr ||ZT > min ||Z* 



u + v 



< Pr ( l|Z|| 2 > min ||Z + u + v|| 2 

v£LT n 



IZII 2 > ||Z + u + v|| 2 



= Pr( [J 

\v6_LT" 

Using the union bound, d23T l can be further bounded by 

Pe,pair< E Pr (ll Z H 2 ^ II Z + U + V|| 2 ) 
veLT™ 

= E Prf-(u + v) T Z> ~||u + v| 



(23) 



veil" 



(24) 



Since Z is a vector of i.i.d. Gaussian components with zero 
mean and variance 1/SNR, the random variable — (u+v) T Z is 
Gaussian with zero mean and variance ||u + v|| 2 /SNR. Using 
the notation 

and recalling that 



Q(t) < exp<; --T 



d24l i becomes 



n ( VSNR, 

Pe.pair < Q \ ~ I|U V 



veLT" \ 



< e ex pf 

veLT" 



SNR 



u + v|| 2 



In order to find the average (over the ensemble) pairwise 
error probability, we need to average P e ,pair according to the 
distribution of U 



E [P e , pair ] < E E 



veLT" 



exp 



SNR 



U+v-H 2 



(25) 



Since the code generation is memoryless, U is also memory- 
less, and ( 1251 ) can be rewritten as 



i[P e . 



< 



e n E 

■veLT" t=l 



exp 



SNR 



(Ut+vt) 



(26) 



s 



Equation (|26l ) can be further simplified by replacing the order 
between the sum and the product 



E[p e , pair ] < n e 

t=l veLT 



(U t + if 



+ E 




(27) 



exp <, —U 



< (e 


exp< 


[-74 


+ 2 exp < 


f SNR L 2 1 \ ™ 

r« 4 1) (2S) 


= ME 


exp< 




+ 2 exp < 


f 3SNR|y i 



= n n 

where d2~7T i follows from the fact that the random variables 
{Ut} t _ 1 are identically distributed, and ( |28] i is true since 
\U\ < L/2, and thus (U + L) 2 > L 2 /A as well as (U - L) 2 > 
L 2 /4. ■ 

In order to obtain an explicit upper bound on the average 
(over the ensemble) pairwise error probability, we are left 
with the task of calculating f2, or equivalently calculating 

E[exp{-^t/ 2 }]. 

We recall that U is a deterministic function of the pair of 
transmitted codewords {Xi, X2} and the pair of "competing" 
codewords {XjjX^}, where each one of the codewords 
is generated as specified in (TTOb . The statistical dependen- 
cies within the set of codewords {Xx, X2, Xj, X<j} corre- 
spond to the linear dependencies within the set of message 
vectors {wi, W2, wj, w^}. Since we assumed the message 
vectors {wi,W2} are linearly independent J| there are only 
four possible cases of linear dependencies within the set 
{wi, w 2 , wj, w 2 }: 

Case A: The four vectors {wi, W2, wj, W2} are linearly 
independent. 

Case B: The vectors {wi, w 2 , wj} are linearly independent 
and W2 is a linear combination of them. 

Case C: The vectors {wi, W2, w^} are linearly independent 
and w; is a linear combination of them. 

Case D: The vectors {wi, w 2 } are linearly independent and 
both w; and are linear combination of them. 

Each case of statistical dependencies induces a dif- 
ferent distribution on U. Thus for the calculation of 
E [exp { — ^pf 2 }] > each case should be considered sepa- 
rately. To that end, we now give upper bounds on £1 for the 
four different possible cases. The derivations of these bounds 
are given in Appendix [Hi 

Case A: The codewords {Xi, X2, Xj, X5} are all statisti- 
cally independent. Given {wi,W2}, there are less than 2 2nR 
pairs of competing message vectors {wj,w 2 } that incur this 

5 The message vectors {wjjW^} are also linearly independent, as the 
decoder only searches over the pairs of linearly independent message vectors. 



kind of statistical dependency. Denote by £7 a the value of fl 
associated with case A. We have 



1 



27T/3 1 _3SNRl2, >, 3SNR 

e ^ {pn) + 2e~— , (29) 



where 



SNR p 

5(p,7) 



min I 

zez p \{o} 



(30) 



Case B: The codewords {Xi,X2,Xj} are statistically 
independent and 

X2 = [0X1 + 6X 2 + cXj]* , 

where a, b, and c can take any value in Z p . Given {wi,w 2 }, 
there are no more than p 3 2 nR pairs of competing message 
vectors {wj , w^} that incur this kind of statistical dependency. 
Denote by £Ib the value of £1 associated with case B. We have 



fin < - 




2tt/3 



3 SNR 

2e-—, (31) 



p y <5 2 0, 7 )SNR 

where S(p, 7) is as in d30b . 

Case C: The codewords {Xi,X 2 ,X2} are statistically 
independent and 

Xj = [0X1 + 6X 2 + cX 5 ]* , 

where a, b, and c can take any value in Z p . Given {wi,W2}, 
there are no more than p 3 2 nR pairs of competing message 
vectors {wj , wj} that incur this kind of statistical dependency. 
Denote by Qc the value of fl associated with case C. We have 



O c < - + 



2tt/3 



„ 3SNR 

2e"— , (32) 



<5 2 (p, 7 )SNR 

where S(p, 7) is as in (f30b . Note that although the bounds OTT ) 
and (l32t are identical, cases B and C are not identical (i.e., 
there is no symmetry) since the two codewords Xj and X 2 
play a different role in the pairwise difference vector U, as 
X§ is multiplied by 7 while Xj is not. 

Case D: The codewords {X!,X 2 } are statistically indepen- 
dent, whereas 



and 



Xj = [ a X,+6X 2 r 



X2 = [cXi +dX 2 Y 



where a, b, c and d can take any value in Z p , except for 
a = 1,6 = 0, c = 0,d = 1 (in which case Xj = Xi and 
Xg = X2). Given {wi,W2}, there are no more than p 4 pairs 
of competing message vectors {wj, w^} that incur this kind of 
statistical dependency. Denote by £1^ the value of £1 associated 
with case D. We have 



£1d < max < — 
.P 




2tt/3 



5 2 (p, 7 )SNR 



2e" 



p-l 



1 

— e 

P 



3SNR 



2e" 



(33) 
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where S(p, 7) is given in (13 Ot . 

We can now establish the theorem. Denote by 



] , i = A,B,C,D, the average error probability 
associated with each case of statistical dependencies. We 
recall that the decoder in our scheme only searches over 
the pairs of codewords corresponding to message vectors 
{\Vi,Wj} that are linearly independent. Thus, an error event 
occurs if the message vectors {wi,W2} chosen by the users 
are linearly dependant. Denote by P b ,e the probability of 
this event (which is independent of the codebook). By basic 
combinatorics 

Pe,E = (p + 1) • 2~ nR - p ■ 2- 2nR < 2p ■ 2~ nR . 

Using the union bound, the average error probability over the 
ensemble can be upper bounded by 

E [P e ] < 2 2nR ■ E[P e , pair , A ] + p 3 ■ 2 nR ■ E[P e , pair , B ] 



? -[P e ,pw,D] + Pe,E 



+ p 3 -2 ni *-E[P e , pair>c ]+p 4 

< 2 2n(-R+5logn A ) 

, 2 ™(-R+3 l2 a- E +logO B ) 
, 2 "(-R+3 l£ | £ +iogO c ) 

^(li^E+logOo) 



Holding p constant and taking n to infinity we see that the 
average error probability goes to zero if 



R < - -logfl A , 
R < - \ogfl B , 

R<-\o g n c , 
>iogr2_o. 



(34) 

(35) 
(36) 
(37) 



The conditions d34t . ( f35l > and (l36T l imply that the rate should 
be taken to satisfy 



R < min 




2tt/3 
SNR 



1 3SNR r2 



2e" 



2tt/3 



<5 2 (p,7)SNR 
whereas condition ( |37| i implies 




2tt/3 



<5 2 (p,7)SNR 



2e 



3 SNR 

2e b- 



< 1, 



and 



p-1 



1 - 
— e 
P 



3SNR 

2e — s- < 1. 



(38) 



(39) 



(40) 



Condition (|39l is satisfied for any positive rate, since it is 
contained in d38l . Condition ( f39l > is equivalent to 



< 1 - 2pe 



3SNR 
8 



Since any prime value of p that satisfies ( |4TT > is valid, we 
can maximize (f38l > over all prime values of p satisfying ( |4T1 ). 
i.e. over all values in V^y) as defined in (0, which yields 
(O. Finally, since there must be at least one codebook in the 
ensemble with a smaller (or equal) error probability than the 
average over the ensemble, the theorem is proved. 



IV. Application to interference alignment 

In the previous section we found an achievable symmetric 
rate for the Gaussian modulo-additive MAC channel where 
both users use the same linear codebook. The motivation 
for developing such a coding scheme is to enable lattice 
interference alignment. 

Assume a receiver observes a linear combination of code- 
words transmitted by several users (corrupted by noise) and is 
interested in decoding only one of the codewords, namely the 
received signal is 



y = h\x\ 



K 

■E 

fc=2 



where {hk}^ =1 are the channel gains, xi is the desired 
codeword, {^k}^ =2 srs the interfering codewords, and z is 
a vector of i.i.d. Gaussian noise. 

One approach is to treat all the interfering codewords as 
noise. This approach would not be effective when the total 
power of the interference is on the order of that of the desired 
codeword (or stronger). Another possible approach would be 
trying to decode all the codewords {x^}, thus treating 
the channel as a MAC with K users. It is well-known (see 
for example Ifl9l ) that at high SNR, the achievable rates as 
dictated by the capacity region of the (Gaussian) MAC channel 
are essentially a "zero-sum" game (up to a power gain), i.e., 
time sharing is nearly optimal. In particular, for such a channel, 
if all users are working at the same rate, the symmetric rate 
scales like 

^logSNR. 

Since the decoder is only interested in one of the codewords, it 
seems wasteful to decode all of the interferers as well. For this 
reason, it is desirable to align all interferers to one codeword, 
as was first noticed in f\3\ . After alignment is performed, 
the receiver only has to decode two codewords: the desired 
codeword xi, and the aligned interference codeword. 

A linear code, as defined in (O, facilitates the task of 
aligning the K — 1 interfering codewords into one codeword. 
Specifically, if all interfering codewords {x/ c }^ 2 are taken 
from the same linear code C, and the channel gains {/i/s}fL 2 
associated with the interfering codewords are all integers, we 
have 



(41) 



K 



,fc=2 



Xif G C, 
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and therefore the received vector can be reduced modulo the 
interval 2" to yield 



(42) 







' K 


* 


y* = 


hiXi + 




+ z 






,k=2 





[hlXl + XIF + z]* 



Since xi and xif are both members of the same linear code- 
book, the equivalent channel in d42l satisfies the conditions of 
Theorem Q] and we can find an achievable symmetric rate for 
it. 

At this point it is worth noting the advantage of joint 
decoding over successive decoding. A successive decoding 
procedure, as used in lfl4l and |[T6l . can decode both code- 
words only if a very strong interference condition is satisfied; 
that is, one of the codewords can be treated as noise while 
decoding the other codeword. For a wide range of values 
of hi, successive decoding does not allow for for positive 
transmission rates. The result of the previous section provides 
an achievable rate region that is greater than zero for a much 
wider range of values of hi. 

We next give a formal definition for the Gaussian inter- 
ference channel, and then use the results of the previous 
section in order to derive achievable rates for certain classes 
of interference channels. 

A. The K-user Gaussian interference channel 

The K-user Gaussian interference channel consists of K 
pairs of transmitters and receivers, where each transmitter k 
tries to convey one message Wk out of a set of 2 nRk possible 
messages to its corresponding receiver. Specifically, the signal 
observed by receiver j is 

K 

Y . = h 33 X 3 + ^2 h i kXk + Z h 

where hjk is the channel gain from transmitter k to receiver j, 
and Zj is the Gaussian noise present at receiver j. All trans- 
mitters and receivers have perfect knowledge of all channel 
gains. We assume that the Gaussian noise at each receiver is 
i.i.d. with zero mean and variance 1 / SNR and that the noises 
at different receivers are statistically independent. We assume 
all transmitters are subject to the same power constraint 

-e[KII 2 ] < 1- 

n 

Each transmitter k has an encoding function 

fk : {l, . . • , 2 nRk } — > 1", 

such that the signal transmitted by user k during n channel 
uses is 

Xfc = fk(w k )- 

Receiver j recovers the message using a decoding function 

g,j : K™ -> {l,...,2 nR *} . 

Let 

=9 Ay j) 



be the estimate receiver j produces for the message transmit- 
ted by transmitter j. We define the error probability as the 
probability that at least one of the receivers did not decode its 
intended message correctly 

P et w = E [Pr ({wx, w K } ^ {ttfi, • • • , %})] , 

where the expectation here is over a uniform distribution on 
the messages. 

We say that a rate-tuple {Ri, . . . , Rk} is achievable if there 
exists a set of encoding and decoding functions such that P e ,iF 
vanishes as n goes to infinity, and that a symmetric rate i? sym 
is achievable if the rate-tuple {i? sym , . . . , i? S ym} is achievable. 



B. The integer-interference channel 

We restrict attention to a special family of 1^-user Gaus- 
sian interference channels which we refer to as the integer- 
interference channel. In this family, all the channel gains 
corresponding to interferers are integers, i.e., for all j 7^ k, 



ijk 



a,jk G Z. 



The following theorem establishes an achievable symmetric 
rate for the integer-interference channel. 

Theorem 2: For the X-user integer-interference channel, 
the following symmetric rate is achievable 



ym 



< max mm 

P enf =1 T"(^)ie{i 



1 . 




(43) 



where 5(-, •) is defined in ©, and V(-) is defined in (0. 

Proof: We begin by recalling the encoding and decoding 
procedures. 

Encoding: Generate a linear ensemble of codebooks of rate 
Rsym over 1 p as described in Section UTI] where p is taken as 
the maximizing value in j43l . Choose the codebook C which 
achieves the smallest average error probability, P e ,if- Each user 
encodes its message using this codebook. 

Decoding: Each receiver first reduces its observation mod- 
ulo the interval 1. The equivalent channel receiver j sees is 
therefore 



K 

k=\,k=^3 



a., fe x fe + z 7 



where 



X 1FJ 



xif, 



K 



fc=i,A#y 



The linearity of the codebook implies that xip.j G C. 
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From Theorem[TJwe know that Xj and xifj can be decoded 
reliably (by receiver j) as long as the symmetric rate i? sym 
satisfies 



(a) 



-Rsym < -77 log -r 



SNR p 



-e 2 p' 



(6) 

-Rsym < ~ log [ - 



and 



/ 2tt/3 
p ' V <J 2 (p, ft^SNR 



p G 



^ 2 (/ lW)P ) + 2e _3 fs 
(44)' 

(45) 

(46) 
for 



3SNR 

2e — ~ 



The codebook C satisfies conditions d44l i. d45l l and 
everjQ 1 < j < A", and thus the theorem is proved. 



C. Integer-interference channel: Degrees of freedom 

Theorem [2] provides an achievable symmetric rate for the 
integer-interference channel that is valid for any SNR. We now 
show that in the limit where the SNR goes to infinity, the 
coding scheme achieves K/2 degrees of freedom, which is 
the upper bound established in [8|. This shows that for the 
integer-interference channel, the proposed scheme is optimal 
in a DoF sense, and thus recovers the asymptotic results of [5|. 

In order to find the number of DoF the scheme achieves, 
we need the following theorem from the field of Diophantine 
approximations, which is due to Khinchin. 

Theorem 3 (Khinchin): For almost every 7 G R, the num- 
ber of solutions to the inequality 

|7~f|<*(0 
for a G Z and Z G N, is finite if the series 



X>(0 



converges, and infinite if it diverges. 

Proof: See, e.g., (20|. ■ 

Setting $(Z) = Z~ 2 ~ £l in Theorem [3] it follows that for 
any ei > 0, and almost every 7 6 M, there exist an integer 
i*(7, ei) for which there are no solutions to the inequality 

L*7l 



I 



7- 



I 



< I 



-l-ci 



(47) 



in the range I > l*("f, ei). Moreover, for such 7 there exist a 
constant c\ > for which 

L'7l 



z ■ 



7- 



z 



>c 1; 



(48) 



for every I < l*("y,ex). Combining (|47l i with j48l , it follows 
that for almost every 7 e K there exist a positive integer 

6 The existence of a codebook C that is simultaneously good for all K 
equivalent MAC channels is guaranteed for any finite number of users K. 



Z*(7, ei), such that if p > Z*(7, ei) there are no solutions to 
the inequality 

L*7l 



7- 



-l-ei 



in the range Z G Z p \{0}. Thus, for any ei > and p large 
enough we have 



<*(p.7) > P 



-l-ei 



(49) 



for almost every 7 G R. 

We now show that the ratio between the symmetric rate 
as given by Theorem [2] and !/4logSNR approaches 1 for 
almost every set of direct channel gains when the SNR tends 
to infinity, and hence the number of DoF is K/2. 

Before giving a formal definition to the number of degrees 
of freedom, we need a few preliminary definitions. We define 
an interference channel code C as a set of encoders {fk}^ = i 
and decoders {gt}^ = i- We define an interference channel 
coding scheme as a family of interference channel codes 
{C'(SNR)}, and define 72.' (SNR) as the set of all rate-tuples 
that are achievable for the interference channel code C'(SNR). 

Definition 1: An interference channel coding scheme 
{C'(SNR)} is said to achieve d degrees of freedom if 



EfcLi R k 



lim sup max 

SNR^-oo Ri,-,RkEK'(SNR) ± log SNR 



= d. 



In order to show that for asymptotic (high) SNR, the 
ratio between the symmetric rate i? sym given by (03]) and 
1/4 log SNR approaches 1, we set p = SNR 1 / 4-62 (where 
€2 > is chosen such that p is a prime number). From (l49l . 
we see that for this choice, for high enough SNR and almost 
every 7 G K we have 

<5(p, 7 ) > SNR- 1/4+ei£2+£2 - £l/4 . (50) 

We now use (l50l in order to find lower bounds on the maximal 
achievable symmetric rate of Theorem |2] for asymptotic SNR 
conditions. 

The argument of the logarithm in (l44t can be upper bounded 
(after some straightforward algebra) by 



1 /2tt/3 t l^-Wj^p,^.) 



p* V SNR p 

<SNR- 1 / 2+2e2 + \/ySNR 1 :> 



3 SNR 

2e — s" 




6 2 (p,hjj)SNR 



l/4+e2 p -%SNR' L '2+ 2 'i'2-'^ 2 , 2 g- 

1 by 

3SNR 

2e — s- 



+ 2SNR ->---e 
and the argument of the logarithm in 

I 2?r/3 



(51) 



< SNR- 4 / 4+£2 + JflsNR- 1 /^-^^/ 4 



Taking e\ and £2 to zero such that 

4e 2 + 2eie 2 - e x /2 > 0, 



2e" 



(52) 
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and taking SNR to infinity we see that the r.h.s. of d5TT i 
and d52l are approximately SNR -1 / 2 and SNR -1 / 4 respec- 
tively. Since (|5TT i and d52l hold for almost every hjj € M, 
they simultaneously hold for almost every set of direct channel 
gains {hjj}^ =1 as well. We thus conclude that the ratio 
between the symmetric rate from Theorem |2] and !/4logSNR 
approaches 1 when the SNR tends to infinity, and therefore the 
number of DoF is A/2 for almost every set of direct channel 
gains. 

D. Example 

Consider the 5-user integer-interference channel where the 
channel gains are the entries of the matrix 



/ 


h 


1 


2 


3 


4 \ 




5 


h 


3 


6 


7 




2 


11 


h 


1 


3 




3 


7 


6 


h 


9 


V 


11 


2 


6 


4 


h ) 



For this channel, we plot the achievable sum rate of our 
scheme (which is 5 times the symmetric rate i? S ymX an d for 
reference we also plot the sum rate a time sharing scheme 
would have achieved. One more curve we plot for reference 
is the curve 

|-ilog(l + (l+fc 2 )SNR), (54) 

which corresponds to the sum rate that could have been 
achieved if the symmetric rate for a two-user Gaussian MAC 
with one linear code was the same as that of the same channel 
with two random codes, in other words, if r norm given in (|9]l 
were 1. In the absence of explicit upper bounds for the K- 
user interference channel with finite SNR, (l54l serves as a 
reasonable benchmark to the best performance one can expect 
to achieve, which is based on the known fact that the number 
of DoF the channel offers is A/2. 

We consider two different values of h: h = 0.707, and 
h = V2/20 The results are shown in Figure [3] In Figure |4] 
we plot the same curves for h = 0.24 and h = v7 /ll. 

These examples show the advantages of interference align- 
ment over time sharing for high enough SNR. For a larger 
number of users, interference alignment is preferable over time 
sharing for lower values of SNR. We note that in this paper we 
have not considered further optimizations for the achievable 
rate, such as combining it with time sharing of powers, or 
superposition (layered coding schemes), which may result in 
higher gains. Time sharing of powers, i.e. having all users 
transmit with more power for some of the time and remain 
silent for the rest of the time is important at low SNR values, 
since our coding scheme only achieves positive rates above 
some SNR threshold. 

An important insight from Figures |3] and |4]is the sensitivity 
of interference alignment to the channel gain h. Even though 
in each figure we have used values of h that are very close, 
the performance of the scheme differs significantly when the 
SNR is very high. This sensitivity is the subject of the next 
section. 

7 Our coding scheme would have the same performance for h + m, m £ Z 
as well; however the reference curves do change when adding integers to h. 
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Fig. 3. The achievable sum rate for the integer-interference channel (53) for 
h = 0.707 and h = %/2/2. 
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Fig. 4. The achievable sum rate for the integer-interference channel (53) for 
h = 0.24 and h = V7/11. 



V. Quantifying rationality 

Previous results regarding interference alignment for the 
time-invariant A"-user Gaussian interference channel were 
mainly focused on the degrees of freedom of the channel. 
In it is proved that the number of DoF of a A'-user 
integer-interference channel are K/2 when the direct channel 
gains are irrational and is strictly smaller than K/2 where the 
channel gains are rational. This result is also supported by the 
results of [6| where it is shown that the degrees of freedom 
of a A-user (not necessarily integer) interference channel is 
A/2, unless there exist some rational connections between the 
channel coefficients. 

Clearly, from an engineering perspective, the dependence 
of the DoF on whether the direct channel gains are rational 
or irrational is very displeasing. It is therefore important to 
understand the effect of the channel gains being rational at 
finite SNR. 

Theorem [2] sheds light on this matter. Specifically, it quan- 
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tifLes the loss associated with a rational direct channel gain 
hjj in the symmetric rate as a function of how large the 
denominator of hjj is w.r.t. the SNR. Specifically, if hjj is 
a rational number of the form hjj = r/q, the symmetric 
rate achieved by the coding scheme can never exceed \ogq. 
This is evident from the presence of the factor 5(p,hjj) in 
((43}. For any p > q we have S(p,r/q) = 0. Thus, in 
order to get positive rates we must choose p < q and the 
symmetric rate would be smaller than log q for any SNR. For 
this reason, a small denominator q limits the symmetric rate 
even for low values of SNR. However, a large value of q 
limits performance only at high SNR. This phenomenon can be 
seen in Figure [3] where at a certain SNR point, the symmetric 
rate corresponding to h = 0.707 (which is a rational number) 
saturates. In Figure [4] this effect is even more pronounced for 
h = 0.24 as the denominator of h in this case is q = 25, 
rather than q = 1000 which is the case for h — 0.707. It is 
also seen from Figures[3]and|4]that for low values of SNR, the 
symmetric rates corresponding to the irrational values of h and 
their quantized rational versions are nearly indistinguishable. 

Another question that arises from the results of [5 1 and is 
how the rate behaves when the direct gains approach a rational 
number. Theorem [2] provides an answer to this question as 
well. If hjj — r/q is small, then 8(hjj,p) would also be small 
for p > q, which would result in an effectively lower SNR. 
However, the function S(p,hjj) is continuous in the second 
variable, and thus, letting hjj approach r/q (where q < p) 
results in a continuous decrease of the effective SNR. 

VI. Non-integer interference channels 

We have seen that for the integer-interference channel, 
the result of Theorem [TJ was very useful for finding a new 
achievable rate region. The requirement that at each receiver 
all the channel gains corresponding to interferers are integers 
is necessary because the codebook we use is only closed under 
addition of integer-valued multiplications of codewords, which 
allows to align all interfering signals into one codeword. 

Unfortunately, the integer-interference channel model does 
not capture the essence of the physical wireless medium. 
Under realistic statistical models for the interference channel, 
the probability of getting an integer-interference channel is 
clearly zero. 

It is thus desirable to transform the original interference 
channel into an integer-interference channel by applying cer- 
tain operations at the transmitters and the receiversJ3 Specifi- 
cally, it is necessary that at each receiver the ratios between all 
interference gains be rational, and then an appropriate scaling 
at each receiver can transform the channel into an integer- 
interference channel. 

Assume that each receiver had scaled its observation such 
that one of the interference gains equals 1, We would like to 
"shape" the other interfering gains seen at each receiver to be 
integers as well, using operations at the transmitters. It turns 
out that by using power back-off at each transmitter, i.e., each 
transmitter k scales its codeword by a factor of ctk < 1 prior 

8 We do not discuss the possibility of adding more antennas at the receivers 
or the transmitters which could also assist in the problem. 



to transmission, it is possible to transform only K — 1 (in 
addition to the channel gains that were equalized to 1 by the 
receivers) of the total K 2 channel gains into integers. It follows 
that perfect alignment, i.e., alignment of all interferers at all 
receivers simultaneously, is not possible (by these methods) 
even for K — 3 users. 

One solution to this problem is performing partial align- 
ment, as described in J6), which is suitable for almost every 
set of channel gains. This method roughly transforms the 
channel seen by each receiver into a MAC with a large 
number of inputs, where about half of the inputs correspond 
to the information transmitted by the desired transmitter, and 
the other half to interferences. At asymptotic (high) SNR 
conditions, it was shown in [6| that this approach achieves 
K/2 degrees of freedom, i.e., each receiver is capable of 
decoding all the inputs corresponding to its intended messages 
in the presence of the interfering inputs. 

An extension of the partial interference alignment approach 
proposed in (6) to the non-asymptotic SNR regime, in the 
same manner we extended the results of [5| for the integer- 
interference channel to non-asymptotic SNR, would require 
an extension of Theorem [TJ to more than two users. Finding 
a non-trivial achievable symmetric rate for a Gaussian MAC 
with a general number of users, where all users are forced to 
use the same linear code, appears to be a rather difficult task. 
Moreover, the symmetric rate found in Theorem [TJ depends 
on the ratio between the MAC channel coefficients. This 
dependence is expressed in our results through the factor 
S(p, 7), which effectively reduces the SNR when it is small. 
The results for the two-user case suggest that for a larger 
number of users, the SNR loss caused by the ratio between the 
coefficients would significantly increase. If this is indeed true, 
partial interference alignment would not result in significant 
gains over time sharing, for the moderate to high SNR regime. 

A different approach that is yet to be exhausted for the time- 
invariant interference channel, is using the time dimension in 
conjunction with power back-off in order to achieve alignment, 
and allow for joint decoding of the intended message and some 
function of the interferers at each receiver. 

Such an approach can be thought of as the interference 
channel's dual of space-time codes, and we refer to it as 
"power-time" codes. An example of such a power-time code 
is given in Appendix [Till The power-time code given in the 
example is suitable for the 3-user interference channel (with 
arbitrary channel coefficients), and allows to achieve 9/8 
degrees of freedom for almost all channel gains. While it is 
already known from [6 1 that the number of DoF offered by this 
channel is 3/2, our "power-time" approach gives an explicit 
expression for the symmetric rate for finite SNR. 

A third possible approach for general interference chan- 
nels, is inspired by the recently proposed relaying strategy 
Compute-and-Forward [11 1. Nazer and Gastpar have shown 
in ifTTl that when using nested lattice codes (see e.g. [21 1), 
it is not necessary to completely align all users. In IfTTl . a 
receiver sees a linear combination of signals transmitted by 
different users, and tries to decode an integer-valued linear 
combination of these signals. In order to do that, the decoder 
multiplies its observation by a scaling factor which plays the 



14 



role of directing the channel gains towards the integer-valued 
coefficients of the linear combination it tries to decode. Since 
the channel gains usually do not have rational connections 
between them, there would always be some residual error in 
the approximation of that integer-valued linear combination. 
By using a nested lattice scheme, this residual error can be 
made statistically independent of the transmitted signals, with 
a distribution that approaches that of an AWGN when the 
lattice dimension tends to infinity. 

A similar technique can be applied to the interference 
channel as well. Each receiver will try to scale the gains of the 
interferers it sees towards some integer- valued vector, and then 
jointly decode this integer-valued linear combination, which 
is a point in the original lattice (which all transmitters have 
used for transmission), along with the desired lattice point 
transmitted by the user which tries to communicate with that 
receiver. The advantage of this approach is that it does not 
require perfect alignment. However, the residual noise caused 
by this imperfect alignment incurs some losses in the achieved 
rate, and will most likely accumulate when K grows. 

VII. Concluding remarks 

We studied the two-user Gaussian MAC where all users 
are restricted to use the same linear code, and derived a new 
coding theorem which provides an achievable symmetric rate 
for it. Some of the bounds in the derivation of the coding 
theorem are rather crude, and as a result our analysis may 
not be very tight. Deriving a tighter analysis would be an 
interesting avenue for future work, and of particular interest 
is an extension of our coding theorem to to the case where 
both users transmit codewords from the same nested lattice 
codebook, with a "good" coarse lattice, as opposed to the one 
dimensional lattice we used. Nevertheless, we believe that the 
derived expression for the symmetric rate captures the essence 
of the problem of restricting both users to transmit codewords 
from the same linear code. The new coding theorem was 
utilized for establishing a new achievable rate region for the 
integer-interference channel which is valid for any value of 
SNR. For a wide range of channel parameters the new rate 
region is the best known, and in particular, in the limit of 
SNR approaching infinity it coincides with previously known 
asymptotic results. We discussed strategies which enable to 
apply our results to the general (non-integer) K-user interfer- 
ence channel, among which is the novel "Power- Time" codes 
approach. 
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Appendix I 
Proof of LemmaQ] 



Proof: Let 



v' = arg min ||z + u + v|| 2 



and denote by S the set of indices for which the absolute value 
of v' is not greater than L, and by S the set of indices for 
which it is greater than L, namely 

S = {s:\v'(s)\<L} 
S = {s:\v'(s)\>L}. 

Let us, vg, v$, zg, z* s be sub-vectors of u, v, v, z, z* in the 
indices S, and u^jV^jV^jZ^jZ^ be their sub- vectors in the 
indices S. It suffices to show the next two inequalities: 



l z s| 



— mm 



and 



z<? — mm 

vsGLTISI 



mm 



mm 



Zs + U S 

\z* s +u s +v s 
|I Z 5 + U S + V 6 



vsir > 

2 



(55) 



> 



(56) 



The first inequality 
we have 



mm z s 



and since z 2 > (z*) 2 . 
In order to see that 



is true since for every index s G S 



min (z* + u s + v s ) , 

v 3 ELT 



, is true we note that for any index 



s (and in particular s G S) we have 

{z* s ) 2 - min {z* s + u s + v s ) 2 < 



On the other hand, for any s € S, the inequality \z s \ > L 
must hold, since otherwise the value of v s <G LZ minimizing 
(z g + u s + v s ) 2 cannot be greater than L. Moreover, since 
\u s \ < L/2, it follows that 

/ r\ 2 



min (zg + ug + vg) < \z s \ - — 

isGLT \ 2 

Finally, for every s E S we can write 



(z s ) - min (z s + u s + v s ) > (zg) z 

v s GLT 



Zg\ 



2\zg 



> 



*\2 



min (zg + u s + v s ) , 



> (Z* S ) 



which establishes 



Appendix II 

Derivation of the Error Probabilities for the 
Different Cases 

We begin this section with four lemmas that are repeatedly 
used in the error probability derivations for the different cases 
of statistical dependencies. The proofs are rather cumbersome 
and are given in Appendix III-AI 

Lemma 3: Let X be a random variable uniformly dis- 



tributed over 



-Z„ 



and let 8 be some random variable 
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statistically independent of X. Define the random variable U 
by 

U=[X + &]*. 
The following inequality holds: 

1 /2vr/3 



E 



< 



P 



SNR 



Lemma 4: Let X be a random variable uniformly dis- 



tributed over 
tically independent 



P p 



and be some random variable statis- 
of X. Define 



U=[(f + e)X + e}*, 
where / is a constant in Z p , and eel. For any /eZ p 

1 



E 



_ SNR Jj2 

e s 



< 



/ 2tt/3 
p ' V <5 2 0,£)SNR' 



where 



(5(», e) = min ^ • 
;gz p \{o} 



I 



(57) 



Lemma 5: Let Xx, X2 and X3 be three i.i.d random vari- 



ables uniformly distributed over 
arbitrary constant. Define 



p P 



and 7 € K some 
U =[X 1 + 1 (X 2 -X 3 )}* . (58) 



Then 



E 



e s 



< 



1 

p 2 



W 3 , I "ff 5 2 (P,7) 

SNR p 



where #(•,•) is defined in (T5Tb . 

Lemma 6: Let X be a random variable uniformly dis- 



tributed over 
Define 



, and e € K some arbitrary constant. 

(59) 



[/= [/JC + eX-e[rX]*]* 
where / e Z p \{0} and r E Z p \{0, 1}. Then 



E 



e s 



p- 1 1 
< l - + -e 



3 SNR 



P 



P 



We also state two properties that are extensively used (some- 
times implicitly) throughout the calculations in this section: 

Property 1: For any IgR and a € Z 

[a [X]*]* = [aX]*. 

Property 2: For any a > and /3 > 

mod [-if) = a \ X \ mod [-&,&) ■ 

We now turn to the analysis of the different cases of 
statistical dependencies. 



Case A 

In this case we have 

U A = [X 1 + 1 X 2 -X 1 - 1 X 2 }*, 

where Xx, X2, X\ and X 2 are four i.i.d. random variables 



. Let 



uniformly distributed over 

X x = [Xx-Xtf, 

and note that X\ is uniformly distributed over 
is statistically independent of X 2 and X 2 . We have 

U A =[X 1 +j(X 2 -X 2 )] 

Applying Lemma [5] gives 

1 



and 



E 



< 



27T/3 i 1 -M!}RS 2 (p,-f) 



(60) 



p" V SNR p 

Denote by fl A the value of fl associated with case A 
Substituting (lo*0l l into d22l we have 



Q A < 



Case B 



1 



27r/3 1 3SNR & / A 3SNR 

— L_ + _ e + 2e — ~ . 

SNR p 



(61) 



In this case 

U B = Xx+ 1 X 2 - Xx - 7 [aX x + 6X 2 + cX{\ 
X! + L7l^2 + (7- \ri\)X*-Xi 

(7- \ r i\)[aXx + bX 2 + cX l ] 
(1 - L7]a)^i + (LTl " il}b)X 2 + (-1 - [^c)Xx 
(7 - Lt| ) (X 2 - [all + bX 2 + cXx] ' 



where X\, X 2 and Xx are three i.i.d. random variables 



uniformly distributed over 



-Zp 



Define 



and 



di = [l- L7la] mo d P . 
d 2 = [L7l - h]b] modp , 
d 3 = I" 1 " L7l c] mod p , 



e = 7- L7l G [~2'5) 



Using these notations we have 



U B = [dxXx + d 2 X 2 + d z Xx\* 

+ e (X 2 - [aXx + bX 2 + cXx}*) 



(62) 
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We now show that if the vectors [di ds] and [a c] are linearly 
independent (over Z p ), the first and the second elements in 
the sum are statistically independent. To that end we prove 
the following lemma. 

Lemma 7: Let A be a full-rank deterministic matrix with 
dimensions M x M over Z p where p is a prime number, and 
X a vector with dimensions Mxl containing elements which 
are i.i.d. random variables uniformly distributed over Z p . 

Let X = AX. (with operations over Z p ). Then the elements 
of X are also i.i.d. random variables uniformly distributed over 
Z p . 

Proof: Since A is full-rank, for any vector x there exists 
one and only one vector x that satisfies x — Ax. Since the 
elements of X are i.i.d. and uniformly distributed, the vector 
X is uniformly distributed over Z^ 4 ', which implies that 



Pr (X = £) = Pr(X = x) = ( - 



for every vector x £ Z p . This in turn implies that the elements 
of X are i.i.d with uniform distribution over Z p . ■ 

Let 



(63) 



With this notation 




can be rewritten as 



U B = [Xi + e{X 2 - X 3 )]* . 

We now have to distinguish between the case where [di d 3 ] 
and [a c] are linearly independent which will be referred to 
as Case Bl, and the case where they are linearly dependent. 
The case where [di d 3 ] and [a c] are linearly dependent and 
[a c] [0 0] will be called Case B2, and the case where 
[a c] = [0 0] will be called Case S3. 

Case Bl - If [d\ d 3 } and [a c] are linearly independent, the 
matrix in (l63l is full rank, and it follows from Lemma |7] that 
{Xi, X 2 , -^3} are each uniformly distributed over 
and are statistically independent. In this case 



-Z„ 



where 



©si — z{Xi — X 3 



is statistically independent of X\. 
Applying Lemma [3] gives 



E 



< 



2tt/3 
SNR 



(64) 



Denote by SIbi the value of associated with case Bl. 
Substituting ( |64l into (|22| > we have 



n B1 < - 
p 



27T/3 3SNR 

— — +2e — —. 
SNR 



(65) 



Case B2 - We now consider the case where [d\ d^\ and [a c] 
are linearly dependent and [a c] ^[0 0]. In this case we have 

[diXi + dzXj]* = [r {aXi + cXf)]* , 



Zn 

P P 



and is 



for some r £ Z p . Let 

X 3 = [aXi+cXi]*, 

and note that X3 is uniformly distributed over 
statistically independent of X^. We can rewrite Ub2 as 

U B 2 = [rX 3 + d 2 X 2 +e(X 2 - [X 3 + bX 2 }*)] * . 

Now let 

X 4 = [X a + bX 2 }*, 

and note that X4 is statistically independent of X 2 . Using this 
notation, we have 



U B 2 = [rXi + (d 2 - rb) X 2 +e (X 2 - X4)] 
= [(/ + £) X 2 + {r-e)X A ]\ 



(66) 



where 



Let 



/ = [da - rb] 



mod p ' 



e B2 = {r-e)X i , 

and note that Qb2 is statistically independent of X 2 . Using 
Lemma |4] we have 



E 



_ SNR tt2 
g S U B2 




2tt/3 



(5 2 (p,7)SNR' 



which means that 



< 




2tt/3 



p y S 2 (p, e)SNR 
We further note that 



3 SNR 

2e — s-. 



(67) 



(68) 



S(p, 7) = min I ■ 

(ez p \{o} 

= min I ■ 

;ez P \{o} 



7 



e — 



I 

JM 
1 



= 5(p,e). 



Thus. 




(69) 



Case B3 - Since a = c = it follows that d% = 1 and 
rf 3 = [—1] mo d p = p — 1. Thus, d62l can be written as 



U B3 

Letting 

e B3 = 



X 1 + d 2 X 2 + (p - + e (X 2 - [6JT 2 ]*) 



d 2 X 2 + (p - + e (X 2 - [6X 2 ]*) 
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and using the fact that X\ is uniformly distributed over 
and is statistically independent of Lemma [3] 



^71 

P P 

can be applied, which yields 

SNR rr2 



1 

< - 

P 



2tt/3 
SNR' 



(70) 



Denote by £Ib3 the value of fl associated with case B3. 
Substituting (f70l into (l22l gives 



< - 



27T/3 3SNR 

— — + 2e — s" 
SNR 



Since S 2 (p, 7) < 1, combining d65l l with 



(71) 

and (T7TT > yields 



Hb<- 




2tt/3 



3SNR 

2e"— , (72) 



p V S 2 (p, e)SNR 
for all possible values of a, 6, and c. 

Case C 
In this case 

U = [X x + ~fX 2 ~ [aX-L + bX 2 + cX 2 }* - ~fX 2 ] * 
= [(1 - a)X 1 + (7 - b)X 2 + (-7 - c)X- 2 ]* , (73) 

where X\, X 2 and X 2 are three i.i.d. random variables 

r 1 * 

uniformly distributed over — Z p . 

We distinguish between the case where a ^ 1, which we 
refer to as Case CI, and the case where a = 1, which we 
refer to as Case C2. 

Case CI - Since « ^ 1 the random variable [(1 — cl)X{\* 
is uniformly distributed over 
random variable 



-Zp 



We further define the 



8 C1 = ( 7 -6)X 2 + (- 7 -c)X Sl 

which is statistically independent of [(1 — a)ATi]*. Applying 
Lemma [3] yields 



e s 



< 



2tt/3 
SNR' 



(74) 



Denote by CIqi the value of f2 associated with case CI. 
Substituting ( F74T > into d22l we have 



Ma < 



1 



27T/3 3SNR 

1 2e — a - . 



p V SNR 
Case C2 - Since a = 1, (1731 can be rewritten as 

U C 2 = [{ 1 -b)X 2 + (- 1 -c)X 2 \* 

= [(e + LTl - b)X 2 + (-e - L7l - c)Xg] 
= [(/ + E )X 2 + (r- E )X 2 r 



(75) 



(76) 



where 



7 - LtI , 



and 

r = [—LtI -c] modp . 

Letting 

e C2 = (r - e)X 2) 
and applying Lemma |4] gives 



^ 1 / 27r/3 _ 3SNR 

^ 2< ^ + V^7W +2e 8 (77) 

Combining (F75T > with d77l >. and using the fact that S(p, 7) < 1, 
yields 



^ 1 / 27r/3 „ 3SNR 

n -<- P ^m^m +2e ^- (78) 

for all possible values of a, b, and c. 

Case D 

In this case we have 

U D = [Xi + 7^2 - aXi -6X2-7 [cXi + ^2]*] * 

= (l-a-L7lc)X! + (L7l ~b- li]d)X 2 



e(X 2 - [cX! +dX 2 ]*) 



(79) 



where X\ and X 2 are two i.i.d. random variables uniformly 



distributed over 
letting 



and 



, and e — 7 — [7] as before. Further, 



di = [1 



[LtI 



LtIc] 



mod p ' 



3d p ' 



we have 



U D = [dxXt + d 2 X 2 +e(X 2 - [cXi + dX 2 ]*)] * . 

This is the most complicated case in terms of the number of 
different combinations of a, b, c and d we have to consider. 
Case Dl corresponds to c ^ 0, Case D2 to {c = 0, a 7^ 1}, 
Case Z)3 to {c = 0, a = 1, d = 1} and finally the case where 
{c = 0, a = 1, d > 2} will be referred to as Case Z?4. The case 
where {c = 0,a = 1, d = 0} does not have to be considered 
as in this case W2 = 0, and hence the message vectors wj 
and W2 are linearly dependent. As we recall, the decoder in 
our scheme does not consider such pairs of message vectors. 

We denote by Vloi , i = 1, • • • , 4, the value of fl associated 
with Case Di. 

Case Dl - Define 



/ = [Lt1-&] 



mod ; 



X* = IcXi 



dX 2 
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which is statistically independent of X 2 as c 7^ 0. Now 

U D1 = [diXi + d 2 X 2 + e(X 2 - X 3 )}* 

= [(dxc- 1 )^! + d 2 X 2 + e(X 2 - X 3 )] * 

= (d 1 c- l ){cX 1 +dX 2 ) 
+ (da - d 1 c- 1 d)X 2 + e(X 2 - X 3 



Since X 2 is uniformly distributed over 



-Zj, 



, d 2 G Z P \{0} 



and de Z p \{0, 1}, we can apply Lemma [6] which gives 
p-1 , 1 -« 



^_D4 < 



3SNR 

2e s". 



(86) 



(80) 



where c 1 G Z p is the inverse element of c in the field Z p . 
Let 



Combining ([82]), ([83), (EH), d86Jl and the fact that S(p,j) < 
1, we conclude that for all values of a, b, c, and d that are 
considered by the decoder (except for a — 1, 6 = 0, c = 0, d = 
1 which does not incur an error event), we have@ 



[die" 1 ] 



fin < 



d < max < - 



mod ; 



1 



2tt/3 



and 



f=[d 2 - d 1 c~ 1 d\ 



mod p 

With these notations, (fSOb can be written as 

L/bi = [rl 3 + fX 2 + e(X 2 - X 3 )]* 
= [(f + e)X 2 + (r - e)X 3 }* . 

Letting 

Qdi = (r-e)X 3 , 
and applying Lemma [4] gives 



P V <5 2 b,7)SNR 



p-1 1 
+ -t 

V V 



2e 



3SNR 

2e — s- 



(87) 



(81) 



fioi < - 



2vr/3 



5 2 (p,7)SNR 



3SNR 

2e — 5-. 



(82) 



Case D2 - Since c = 0, (1791 1 becomes 

(1 - a)Xi + d 2 X 2 + e(X 2 - [dX a ] 



A. Proofs of Lemmas \3\ [4] [5] and 

The aim of this subsection is to prove Lemmas [3] HJ [5] and [6] 
We begin by deriving some auxiliary lemmas that will be used. 

Lemma 8: 

(a) For any 8 £ [-5, 3) and p > 

£ e -P(fc+«) 2 < e - P e 2 + / e -P- 2 dx . (88) 

(b) For any 6 £ R and p > 



U D2 = 
We define 



E 



-p(fc+e) 2 < e -p(e-Lei) 2 + Jl K1 



& D2 = d 2 X 2 +e(X 2 - [dX 2 ]*), 



which is statistically independent of (1 — a)Xi. Since 1 
the random variable [(a— 1)Xl]* is uniformly distributed over 



E 



P MP 

(89) 

Proof: In order to prove part (a) of the lemma, we write 

- 1 00 



,- P {k+eY = p-pe' 



e -rfW + £V 



■p{k+ey 



-Z„ 



We can therefore apply Lemma [3] and get 
l 



fe=i 



< - 

P 



27T/3 „ 3SNR 

— '— + 2e 5-. 
SNR 



= e " pe2 + E e- p(fc - 9)2 + J2 e- p{k+9)2 



(83) 



fc=i 



k=l 



(90) 



Case £>3 - Substituting c = 0, a = 1 and d = l into ( f79l 
gives 

= [-Wf 2 ]*. 
Applying Lemma [3] with 9 = gives 



We have 



P 00 x 
/ e~ px2 dx > rn 



fe=i 



E e " 

fc=i 



mm e 

-i+e,fc+e] 



-p(k+ey 



(91) 



fio3 < - + \/ -r^r + 2e 



(84) 



and 



p V SNR 

for any 6^0. The case b = is not interesting because it 
implies wj = wi and w 2 = w 2 , and an error does not occur. 

Case DA - We are left only with the case a = 1, c = 0, d > 
2 for which 



/9 poo °° 

e~ px ' ' dx = / e~ px2 dx > min 
-00 J-e j-^x^[k-i-6,k 



k=l 

00 

E< 

fc=i 



-p(k-ef 



(92) 



C/d4 - [d 2 X 2 + sX 2 -e [dX 2 ]*] ' 



(85) 



'We also used the fact that 7 , , 1 1 , = e 1 1 , . 
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Substituting (|9TJ and (O into (J90J yields 



Proof of Lemma \3} 



E 



e -P(fe+») 2 < 



k— — c 



= E 



E 



— SNK j j 2 

-, « u 



= 9 



e- px dx + / e~ px dx 
Je 

e- px2 dx, (93) 



< maxE \e~~ u 16 = 



max 



fc=0 

p-i 



which establishes the first part of the lemma. 
In order to prove part (b), we have 



max — > e v ' 

^ k=0 



1 \ - 

< max — } e 8 p 
6>eR p ^— ' 



oo oo 

e -p(fc+e) 2 = e -p(fe+Lei+(f- 

A;— — oo A;— — oc 



Lei)) 2 



(94) 



A- 



< max - > e 8 p 2 
ee [-0.5,0.5) P zJ^-i 



Letting = 9 - [ff] e [-§, f) and fe = k + \0\, we have 



f.f)) 
" (fc+e) 2 



(98) 



E « 

= — oo 



- P (fc+e)- 



- E ^ 



(fc+e) 2 



a i 

p \ 

_ i 



n-8p 2 \ 
12SNR / 



(99) 



2tt/3 
SNR ' 



< e 



-p0 2 



- px dx 



~ 2T 7^dx 



n 



(95) 



where d98l l follows from Lemma [9] and (|99l follows from 
part (b) of Lemma [8] and the fact that L 2 = 12. ■ 

Before proceeding to the proof of Lemma |4] we need to 
derive two more simple results. 

Lemma 10: Let T> be some constellation of finite cardinal- 
ity \T>\, with minimum distance 



where we have used part (a) of the lemma for the first 
inequality. ■ 

Lemma 9: For any 9 € K and a prime number p 



mm 



\X 1 - X 2 \ 



Then 



(100) 



[Zp + 9} 



p — 1 ~ 
= Z„ - ^— + 0, (96) 



od[-f,f)-^ 2 



where 9 = 9— [9~\ e [— |, |), and the notation = stands for 
equality between sets of points (constellations). 

Proof: 



E^ < E e-^ fc - d ™"+ ) 2 , 

a;£D k— — oo 

for some € KL 

Proof: Let us sort the points of 2? in ascending order by 

do-A- Mam <...< d -A_! < d < d +A 1 ...< d +A M ^, 



where 



do = argmin \x\ 

x£T> 



[Z p + 9] , 



Z P + L 6 *! ] mod [-§,-§) " 

Z J mod [-f ,f ) 



mod [-f,f) 



We have 


Mneg 




Mpo, 


E 


= ^° e -p(<i Q -A_ t ) 2 _, 


- e- pd ° 


f e -p(d +A fc ) 2 


l£P 


fe=l 
Af ncg 


M pos 


fe=i 




< e -p(d -fc-d m , n ) 2 


^E e ~ 


-p(do + fe-(i min ) 2 




fe=l 
oo 


k=0 





p-1 



(97) 



< E «- 

k= — oo 



p(do+fe-d min ) 2 



We are now ready to prove Lemma [3] 



Setting 9 = do the lemma is proved. ■ 
Remark 1: We note that the bound (llOOl i is rather loose, 
and is one of the weakest links in the chain of bounds we use 
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for obtaining an upper bound on the average pairwise error 
probability E(P eiPa j r ). 

Lemma 11: Let 



Proof of Lemma [4} 



E 



e « 



E 



X 



L 



= - fZ„ 

P 



E 



e ~— u \e = e 



<maxE|e-^ a \G = 6 



and let V = [(/ + e)X + 6}* where / £ Z p , and e, 6» <E K are 
arbitrary constants. The minimum distance in the constellation 
T> is lower bounded by 



p 

^ fe=0 
oo 

< max - y, e 



max 
6>eR p 



96R P 



(103) 



fc— — oc 

^ °° SNR-d 2 



rfmin > — min Z 

p zez p \{o} 



[Is] 



max — N 

0GR P 



Proof: The distance between any pair of distinct constel- 
lation points can be written as 



|[(/ + £ )z 1+ 0]*-[(/ + e )x 2 + 0]*| 

> | [[(f + e) Xl + 6}*-[(f + e)x 2 + 6}*}*\ 

= \[f{x 1 -x 2 ) + e{x 1 -x 2 )}*i (101) 



P \ 



M 1 



8?r 



SNR ■ d; 



2 

min 



87rp 2 



SNR-<5 2 (p,e)I. 2 



(104) 
(105) 



2tt/3 



<5 2 (p,e)SNR' 



where xi and x 2 are two distinct points in 



p p 



Letting 



xi = —xi e [1 P ] mod [_§,p) 



and 



x 2 = -x 2 e [l P ] mod [_|,|)« 



where (1103b follows from Lemma [10] (11041 follows from part 
(b) of Lemma [8] and ( 1105b from Lemma QT| ■ 

We use a similar technique for the proof of Lemma 

Proof of Lemma \5} Let 



Xi = yX tl for i = 1,2,3, 



such that X\, X 2 and X3 are three i.i.d. random variables uni- 

mod [-p/2,p/2)' 



formly distributed over [Z p ] , _,„ _ /rA . With this notation, 



can be written as 



we can further bound (110U as 



[/(xi - x 2 ) + e(xi - x 2 ) 
L 



Further, let 



P 

L 
> — 

P 



[f(xi - x 2 ) + e[xt - x 2 )] mod h | )g) 
[f(xi - x 2 ) + e(xi - x 2 )] mod 

X 2)] mod [-1,1) 



[E(Xi 



> — min Ze — \le~\\ 

P i6Z P \{0} 1 



> — min I ■ 

p iez p \{o} 



JM 



e = x 2 - x 3 



and note that for 8 £ [0, ±1, . . . , ±(p — 1)] we have 



Pr ( e = ,) = ^B<i. 



(102) 



Now, 



SNR TV 2 , 

\ e - — u = E 



where inequality ( 1102b is true since 0<|ii~X2|<p — 1. ■ 
Aided by Lemmas [10] and [TT] we can now prove Lemma [4] 



E 
p-i 

= E Pr ( e = 

e=-(p-i) 



p^ p 



SNR rr 2 , „ 

3-— 17 e = 



(106) 



(107) 



21 



For any value of 6, we have 

E 



SNR r r 2 

e 8 



= E 



=d [-*,«) 



1 2 

1 E 



e -f?(fe+(7e-L791)) 2 



< I f e -f?M-L7ei) 
V \ 



p 2 2n/3 
SNR 



2?r /3 , l e -^ (7 e-L T ei) 



(108) 

(109) 
(HO) 



SNR p 

where (1108b follows from Lemma [9] and ( 1109b follows from 
part (b) of Lemma [8] . 

We note that for 6 = 0, (111 01 > becomes 



E 



and for any 6 ^ 



16 = 



< 



2tt/3 1 
SNR + p' 



(111) 



E 



e s 



SNR p 

which follows directly from the definition of 5(p, 7) and the 
fact that |0| G Z p \{0}. 

Substituting (fT56t . ( ITTTb and lfTT2l into (fToTT i yields 



E 



e s 



SNR 



Proof of Lemma [6} 
The first step in proving the lemma is showing that for any 

r e Z p \{0, 1}, there exists at least one value of x E 

for which \u\ > ||e mod t _i 

For any value of x £ 

[ZJ , r p P s. We have 

I PI mod [—5,5) 



, define J 



-a; g 



> 



[/x + ex — e[rx]*}* 
L 

/x + £X-£[rxj mod[ _p )f) 

L 



fx + ex — e [rx] 
e(x~ [rx] mod [_!,!) 



od [-f,f) 



mod [— ^,^) 



We focus on the expression 

x — \rx] 



(113) 



(114) 



We show that for any value of2<r<p— 1 there exists 
a value of x G [Z p ] mod £ ^ for which (1114b equals 1. In 
order to see that, we observe that the equation 



X J mod [-§,!) 

- [(1 ~ r)x] 



od [-£,£) 



mod [-§,§) 



= 1, 



(115) 



has a (single) solution for any r G Z p \{l}. Further, since 

-(p-l),(p-l)], (116) 



mod [-§,§) € 



it can be deduced that for any r £ Z P \{1}, there exists a 
(single) value of x for which (II 14b equals either 1 or — (p— 1). 
Since (II 14b equals — (p — 1) only if 



p-1 



mod [-£,§) 



which is possible only for r = p — 1, we conclude that for 
any 2 < r < p — 1 there is a value of x, which we denote x 1 , 
for which (II 14b equals 1. Substituting x 1 into dl 131 ) yields 



(117) 



L 




L 






£ mod l-U) 


> — 


£ mod [-iij 


P 




P 





We are left with the case r = p — 1, for which 

x - [rx] mod [_!,!) = x - [-5] = 2x. 
Substituting (1118b into ( 1113b gives 

[2xe] mod r 1 



(118) 



H>- 



(1 12) It follows that for r = p — 1 and x = 1 



\u\ > - [2e] mod 



> 



mod 



.1 1) 

4 ' 4 ^ 



(119) 



Combining (II 17b and dl 19b , we conclude that for any r G 
Z p \{0, 1}, there exists at least one value of x for which 



M > - 

P 



(120) 



Now, since at least one of the equiprobable p possible values 



of U is bigger (in absolute value) than 



E 



e s 



E 



SNR rr2 

e § 



can be upper bounded by 

SNR jjl 
« u 



p ^ 



<P_± + l e ~^ 



P 



P 



p-1 1 
+ -e 

p p 



3 SNR 



(121) 



Appendix III 
An Example of a 3-User Interference Channel 
Power-Time Code 

In this section we introduce a coding scheme that utilizes 
both power back-off at the transmitters, and the time domain, 
in order to allow for perfect interference alignment (with some 
loss in the number of DoF). We illustrate the scheme by an 
example which is useful for the general 3-user interference 
channel and achieves 9/8 degrees of freedom out of the 3/2 
DoF afforded by the channel for almost all channel gains 
(see 0). 
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We consider the channel 



hn 


hi2 






tl22 


h23 


h.31 


hz2 





in 


1 


hi3 


''21 


h,22 


1 


1 


h 3 2 


^33 



We use the channel An times, in order to transmit 3 codewords 
of length n by each user. We refer to n consecutive channel 
uses as a frame. The actions taken by the transmitters and 
the receiver vary from frame to frame, as will be described 
in detail. All transmitters and receivers use the same linear 
codebook C of rate i? sy m and length n during all frames. The 
codeword transmitted by user k at frame t is denoted by Xfc^. 

We assume that hi3 > h\2, /122 > ^23 and /132 > /131. 
There is no loss of generality in this assumption, as the scheme 
we now describe can be easily modified for different ratios 
between the channels gains. For all frames, receiver 1 scales 
its observation by I//112, receiver 2 scales its observation by 
I//123 and receiver 3 scales its observation by I//131 such that 
the equivalent channel is 



H 



where hij = hij/h.12, h 2 j = h 2 j/h 2 3 and h 3j = h^j/h^i. 

We describe the operations taken by the transmitters and the 
receivers at each frame. 

1) Frame 1: User 1 transmits the codeword xi.i, user 2 
transmits the codeword X2 1 and user 3 transmits the 
codeword X3.1. 

User 3 scales its codeword by the factor 03 — l/h\3, 
and all other transmitters do not scale their codewords. 
The equivalent channel is thus 

/ hn I 1 
Hi = h 2 i hi2 l/hi3 
V 1 ^32 h 3S /hi 3 

Due to the perfect alignment at receiver 1, it can decode 
X11, the codeword transmitted by user 1. The other 
receivers cannot decode their codewords at this stage. 

2) Frame 2: User 1 transmits the codeword xi 2, user 2 
transmits the codeword X2,2 and user 3 transmits the 
codeword X3.2- 

User 1 scales its codeword by the factor a.\ = l/ft.21, 
and all other transmitters do not scale their codewords. 
The equivalent channel is thus 

(hu/h 2 i 1 hi3 
1 ~h 22 1 
1/^21 ^-32 h 33 

Due to the perfect alignment at receiver 2, it can decode 
X2,2, the codeword transmitted by user 2. The other 
receivers cannot decode their codewords at this stage. 

3) Frame 3: User 1 transmits the codeword xi 3, user 2 
transmits the codeword X2,3 and user 3 transmits the 
codeword x 3 3. 



User 2 scales its codeword by the factor 012 = l/^32> 
and all other transmitters do not scale their codewords. 
The equivalent channel is thus 



H3 = 



hn }l h 32 

fr-21 ^22//l32 
1 1 




Due to the perfect alignment at receiver 3, it can decode 
X3 3, the codeword transmitted by user 3. The other 
receivers cannot decode their codewords at this stage. 
4) Frame 4: In this frame, each user repeats a codeword it 
has already transmitted in one of the previous frames. 
Specifically, user 1 transmits the codeword xi.i, user 
2 transmits the codeword X2.2 and user 3 transmits the 
codeword X3.3. 

None of the users scale their codewords, such that the 
equivalent channel is H. Now, receiver k observes the 
signal 



Yfe,4 - ^2 hjk^-j,j 
3=1 



- Zfc, 



where z,t is the Gaussian noise present at receiver k. 
Since receiver k had already decoded the codeword xj; & 
in the k t h frame, it can subtract hkk x k,k from 5^4 which 
results in the equivalent two-user MAC channel 



3 

V 



Yfc,4 = Vfc,4 — hkk^-k,k — n jkXj,j + Zfc- 

User k can now decode the two codewords transmitted 
by the other users during the fourth frame. For instance, 
in this step, user 1 decodes the codewords X2.2 and X3 3. 
Now that user 1 has the side information X2,2 and X3 3, 
it can return to its observations from the second and 
the third frames. It can subtract from yi,2 the term 
X2,2, leaving it with a two-user MAC which allows it 
to decode Xi,2- In the same manner, it can subtract the 
term ^13X3,3 from y^a, leaving it with a two-user MAC 
which allows it to decode X13. 

The same procedure is done by each one of the other 
decoders. 

The described power-time code results in 3 different code- 
words, each with a rate that scales like !/4logSNR, that were 
decoded by each decoder. Taking into account the (symbol) 
rate of the power-time code which is 3/4 (since the fourth 
channel use is "wasted"), we get a sum rate that scales like 
1 1 log SNR, which means that the number of DoF is 9/8 . 
More importantly, using Theorem Q] we can find an achievable 
symmetric rate for this power-time code for any SNR. From 
our DoF analysis of llV-CI we know that for almost any channel 
realization there exist a certain value of SNR from which the 
described power-time code outperforms time sharing. 
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